GOjen: tdGo Temporal Difference Learning of Go Playing Artificial Neural Networks

نویسنده

  • Thomas Stütz
چکیده

The original project description has been: An existing Java application handling and visualizing Go games between human and computer players (including trained and evolved ANNs) should be improved and extended with Go playing ANNs trained by temporal difference learning. This extension should serve as a basis for comparisons of td learning with conventional ANN training and evolutionary methods. The ancient eastern board game Go has been proven to be of high complexity. Therefore, conventional computer programs are still far away from world class player level. These properties make the game interesting for applying and testing new approaches, namely temporal difference learning and artificial neural networks (ANNs).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning to Evaluate Go Positions via Temporal Difference Methods

The game of Go has a high branching factor that defeats the tree search approach used in computer chess, and long-range spatiotemporal interactions that make position evaluation extremely difficult. Development of conventional Go programs is hampered by their knowledge-intensive nature. We demonstrate a viable alternative by training neural networks to evaluate Go positions via temporal differe...

متن کامل

Temporal Difference Learning of Position Evaluation in the Game of Go

The game of Go has a high branching factor that defeats the tree search approach used in computer chess, and long-range spatiotemporal interactions that make position evaluation extremely difficult. Development of conventional Go programs is hampered by their knowledge-intensive nature. We demonstrate a viable alternative by training networks to evaluate Go positions via temporal difference (TD...

متن کامل

Learning to Play Draughts using Temporal Difference Learning with Neural Networks and Databases

This paper describes several aspects of using temporal difference learning (TD) and neural networks to learn game evaluation functions, and the benefits of using databases. Experiments in tic-tac-toe and international draughts have been done to measure the effectiveness of using databases. The experiment of Tic-TacToe showed that training from database games resulted in better play than learnin...

متن کامل

Evaluation in Go by a Neural Network using Soft Segmentation

In this article a neural network architecture is presented that is able to build a soft segmentation of a two-dimensional input. This network architecture is applied to position evaluation in the game of Go. It is trained using self-play and temporal difference learning combined with a rich two-dimensional reinforcement signal. Two experiments are performed, one using the raw board position as ...

متن کامل

The Integration of A Priori Knowledge into a Go Playing Neural Network

The best current computer Go programs are hand crafted expert systems. They are using conventional AI technics such as pattern matching, rule based systems and goal oriented selective search. Due to the increasing complexity of managing this kind of knowledge representation by hand, the playing strength of these programs is still far from human master level. This article describes methods for i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004